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ABSTRACT 

This report examines the limitations of research that 
has been conducted on the effects of retention and social promotion. 
It reexamines seven research studies that provide strong evidence 
based or. strong methodology that includes the following three 
characteristics: (1) the studies identify the basis of comparison, 
that is, whether they are comparing the achievement of retained 
versus promoted students at the same age or same grade level; (2) the 
studies identify the specific ;:ind of educational program that 
students receive after eitner reten* on or promotion; and (3) the 
studies examine the long-term effec^. of retention and promotion. It 
is concluded that neither retention uor social promotion is a 
satisfactory response to the need to provide appropriate instruction 
for low performing students. Two tables present summaries of tne 
studies and their effect sizes. A 29-item list of references is 
included. ( Author /SLD) 
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The Center 



The mission of the Center for Research on Effective Schooling for Disadvantaged Students 
(CDS) is to significantly improve the education of disadvantaged students at eachTevel of 
schooling through new knowledge and practices produced by thorough scientific study and 
evaluation. The Center conducts its research in four program areas: The Early and Elementary 
Education Program, The Middle Grades and High Schools Program, the Language Minority 
Program, and the School, Family, and Community Connections Program. 

The Early and Elementary Education Program 

This program is working to develop, evaluate, and disseminate instructional programs capable 
of bringing disadvantaged students to high levels of achievement, particularly in the fundamental 
areas of reading, writing, and mathematics. The goal is to expand the range of effective 
alternatives which schools may use under Chapter 1 and other compensatory education funding 
and to study issues of direct relevance to federal, state, and local policy on education of 
disadvantaged students. 

The Middle Grades and High Schools Program 

This program is conducting research syntheses, survey analyses, and filed studies in middle 
and high schools. The three types of projects move from basic research to useful practice. 
Syntheses compile and analyze existing knowledge about effective education of disadvantaged 
students. Survey analyses identify and describe current programs, practices, and trends in middle 
and high schools, and allow studies of their effects. Field studies are conducted in collaboration 
with school staffs to develop and evaluated effective programs and practices. 

The Language Minority Program 

This program represents a collaborative effort. The University of California at Santa Barbara 
lsfocusing on the education of Mexican-American students in California and Texas; studies of 
dropout among children of immigrants are being conducted at Johns Hopkins, and evaluations of 
learning strategies in schools serving Navajo, Cherokee, and Lumdee Indians are being conducted 
by the University of Northern Arizona. The goal of the program is to identify, develop, and 
evaluate effective programs for disadvantaged Hispanic, American Indian, Southeast Asian, and 
other language minority children. 

The School, Family, and Community Connections Program 

This program is focusing on the key connections between schools and families and between 
schools and commtnities to build better educational programs for disadvantaged children and 
youth. Initial work is seeking to provide a research base concerning the most effective ways for 
schools to interact with and assist parents of disadvantaged students and interact with the 
community to produce effective community involvement. 
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Abstract 

This report examines the limitations and inadequacies of research that has been conducted 
on the effects of retention and social promotion, and re-examines the research studies that provide 
strong evidence based on strong methodology that includes three characteristics: ( 1 ) the studies 
identify the basis of comparison, that is, whether they are comparing the achievement of retained 
vs. promoted students at the same age or at the same grade level; (2) the studies identify the 
specific kind of educational program that students receive after either retention or promotion; and 
(3) the studies examine the long-term effects of retention and promotion. The report concludes that 
neither retention nor social promotion are satisfactory responses to the need to provide appropriate 
instruction for low performing students. 
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Introduction 



Perhaps no decision about a child's education is 
fraught with as much emotion and confusion as 
the decision to promote a low achieving child to 
the next grade or to retain that child in the same 
grade. Current research is vividly pointing out the 
negative consequences of grade retention 
(Shepard and Smith, 1989). Despite the research 
results pointing to negative effects and the costs 
to districts of grade retention, ihe practice contin- 
ues. In fact, retention in some areas (specifically 
kindergarten) appears to have dramatically in- 
creased during the 1980s. In many urban dis- 
tricts, the cumulative effect of grade retention is 
such that 1 in 2 students will repeat a grade by the 
third grade. 

Historically, grade retention has been associated 
with later lack of success in school and with 
eventual dropping out of school (Grissom and 

Overview of Issues in Grade 

The need to make decisions about promotion and 
retention arose with the introduction of age- 
graded schooling in America in the 1840s. With 
the introduction of graded classes, the question of 
standards for promotion from grade to grade be- 
came an issue. 

Throughout the years, toughening or loosening 
the standards has rejected tie political and reform 
climate of the particular era. In tim?s when 
scnools are under pressure to improve perfor- 
mance - for whatever reason - there is an in- 
evitable emphasis on standards and on the need to 
lighten requirements for promotion from grade to 
grade. When schools lack credibility in the eyes 
of the public, tightening standards is one very 
public way by which schools can appear to be re- 
sponsive. 

Shepard and Smith (1989) find that the emphasis 
on social promotion or on retention has varied 
markedly across the history of public schooling in 
the United States. In the 1800s rates of grade 
repetition were extremely high, affecting as many 
as 70 percent of all students in any one year. 
There is historically, as well as currently, little 
systematic evidence on rates of retention, only il- 
lustrative examples. For instance, in Iowa in the 
early 1900s, about 50 percent of the students 
were retained each year. This figure declined to 
about 25 percent in the 1930s and eventually to 
around 10 percent in the 1960s. Also, retention 



Shepard, 1989). Disadvantaged students, males 
and minorities are more likely to be retained in 
grade. Grade repetition is a major, commonly 
occurring event in the educational histories of less 
successful students. 

This report provides a brief overview of issues 
involved in retention/s^al proi lotion policies, 
describes the limitations «j*d mr Jequades of re- 
searcn on the issues, ami re-examines the re- 
search studies which provide the best evidence 
about the effects of grade retention on students. 
A final section discusses how the research on 
grade retention is linked to general efforts at 
school reform and attempts to cope with student 
diversity in the classroom. 



Retention and Social Promotion 

rates varied widel> across die United Slates at any 
given time. A 1909 study by Ayres indicated that 
one Massachusetts school district retained 7.5% 
while a Tennessee school district retained 75.8% 
of its students each year. 

In the 1930s educators recognized that grade 
repetition might endanger students' social and 
emotional development, which gave rise to the 
practice of social promotion. As a result of this 
policy, students were passed on to the next grade 
even if they were not ready for die work. Thus 
the proportion of overage students at each grade 
level declined from 1918 until 1952. However, 
providing homogeneous groups for instruction - 
which was accomplished by retention was 
simply met by other avenues. Grouping and 
tracking practices rose accordingly, and dropout 
rates increased. 

The current educational reform movement has 
seen an increased focus on standards and corre- 
sponding increased rates of grade repetition. The 
Nation at Risk (1983) report, which captured the 
spirit of the reform movement, specifically called 
for increasing attention to standards and ad- 
vancement to the next grade on the basis of aca- 
demic progress and not on age. Nineteen states 
have since established specific standards for 
grade promotion and graduation requirements. 
That such standards lead to greater rates of reten- 
tion is exemplified by tl.c experience of the 
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Atlanta public schools. In 1981, after instituting 
minimum competency requirements, the retention 
rate was four tunes the rate in 1980. 

The current cumulative rates cf grade repetition 
are in many instances as high as they were before 
social promotion became popular (Shepard and 
Smith, 1989, Table 3). Many states now have 
annual rates of grade repetition of about seven 
percent. If most students are not retained in more 
than one grade, this rate of seven percent in one 
particular year generates cumulative rates of at 
least 50 percent, which is similar to the rales prior 
to Ae advent of social promotion. 

The Basts for Retention as a Policy 

Two themes serve as the basis for retention as 
educational policy . The first theme is most often 
encountered in decisions regarding retention or 
extra-year placements in kindergarten and first 
grade. The basis for retention at this grade level 
is typically student immaturity. Students given 
the gift of time" - repeating the same grade - 
have an opportunity to mature. Such authors as 
Gesell (1982) and Ames (1966, 1980) advocate 
testing to determine a child's developmental levei 
and placement of the child in school on the basis 
of his or her developmental, not chronological, 
age. 

This viewpoint has been particularly influential in 
pre-kindergarten and kindergarten years, where 
students are retained because of "immaturity" or 
simply not being behaviorally ready for school. 
This philosophical approach specifically denies 
that intervention to improve student maturation or 
other deficits is possible or desirable. These 
programs follow a maturational philosophy in 
which readiness for school work is a quality 
which unfolds on its own timetable, not to be 
rushed or pushed. 

Research and 

Although literally hundreds of reports, reviews, 
dissertations and essays have examined the bene- 
fits and drawbacks of grade repetition, they pro- 
vide limited evidence about whether grade repeti- 
tion will actually help or harm a student Four is- 
sues limit the uti'iry of most of the research. The 
first limitation is the design of the research 
(Jackson, 19^5). The second limitation is that 
studies fail to identify the basis of comparison or 
they improperly combine and aggregate results 
that use different bases of comparison. 



The second theme is that low achievement is 
caused by a lack of exposure to the material being 
taught ami thus can be remediated by recycling 
the student through the material. In this theme, 
student "failure" is not deemed a fsilureof the ed- 
ucational system, but of ule student. The student 
needs more time to mature or additional time to go 
through the material. Rarely is it assumed mat the 
approach or content is inappropriate for the 
learner, rather, it is assumed that the learner is in- 
appropriate for the material being presented. 

A policy of recycling a student through the mate- 
rial also assumes that learning is a linear process 
and that mastery of content at one level depends 
upon mastery at a previous level. It assumes that 
what is to be learned must be broken down into 
sequential steps and that mastery of the whole is 
only achievable by mastery of the parts. 

Failure is also often seen as a positive experience 
for children because it motivates them to succeed. 
Fear of failure is often offered as a reason to have 
grade repetition as a policy. 

Who is Retained? 

The numerous studies of grade retention provide 
a consistent profile - the retained student is more 
likely to be male, to be younger than his class- 
mates, to come from a lower socio-economic 
family background, to be black or Hispanic, to be 
a behavior problem, and to be immature. Also, 
students are more likely to be retained in the 
South than in any other region. Students are 
more likely to be retained at specific transitional 
points, such as kindergarten or fust grade (school 
entry) or grade six (exit from elementary and en- 
try into middle school), or grade nine (high 
school entrance). 



Its Limitations 

The third limitation is that the studies fail to iden- 
tify the educational practices) called "retention" 
or they inappropriately combine studies which 
use very different practices. For example, meta- 
analyses often combine the effects of studies of 
programs which simply recycle students with 
studies of programs which provide specialized 
remedial assistance. 

The fourth limitation is that many studies fail to 
examine the longitudinal effects of retention, 
which could help determine why early grade re- 
tention is such a powerful indicator of later school 



failure. Also, a good (teal of research focusing 
on educational practices in the early years has 
documented the fade out phenomenon - that is, 
initial effectiveness followed by gradual loss of 
effects. Retention seems to exhibit this phe- 
nomenon, but this needs to be clarified through 
tarptudinal studies. 

The design limitations discussed by Jackson 
(1975) are typically acknowledged in discussions 
of grade retention, but the difficulties imposed by 
failure to identify the basis of comparison, to de- 
scribe the educational practice under study, and to 
look for longitudinal effects are not readily dis- 
cussed in the current literature. 

Research Design 

Jackson's (1975) review of grade repetition cate- 
gorized studies by their methodology and pointed 
out the influence of design on the results of the 
studies. Jackson reviewed 44 studies conducted 
from 191 1 through 1973. He classified the avail- 
able studies into three study designs: 

Design Type I: Studies which compared the re- 
tained students with promoted students. This 
type of design is biased in favor of promoted 
children. Those students who are promoted are 
not likely to have the same academic and social 
problems as those who are being retained or else 
they would not be promoted. The existence of 
pre-existing differences in the two populations 
prior to the event of retention which are not con- 
trolled for in the analyses invalidates the results of 
Type I design. 

Design Type II: Studies which compared retained 
students before and after retention. This type of 
design is biased in favor of the retained students. 
This design does not compare retained and pro- 
moted students, but shows only the effects of 
spending two years learning material which was 
intended to be learned in one year. It is reason- 
able to expect that students will make more 
progress the second time around, so this type of 
study is biased in favor of finding effects for re- 
tention. These studies do not control for other 
factors which positively affect growth. 

Design Type III: Studies which randomly as- 
signed equivalent students to promotion or reten- 
tion. These studies provide the best evidence for 
effectiveness. Jackson found only three in this 
category (Cook, 1941; Farley, 1936; Klcne and 
Branson, 1929) and the most recent was pub- 
lished in 1941. Cook (1941) looked at the results 
of retention in a year-long study of students in 
grades 1-7 and found no significant differences. 



Farley (1936) examined the effects of retention in 
a semester-long study and found that the pro- 
moted students outperformed the retained stu- 
dents. Finally, Klene and Branson (1929), in 
their study of retention and promotion of second 
through sixth graders, found that the promoted 
students fared better than the retained students, 
but did not report levels of significance or enough 
data to allow computation of levels of signifi- 
cance. 

Thus, these three key studies produced only one 
significant effect and that favored the promoted 
group over the retained. Jackson interpreted 
these results to mean that no valid research results 
showed the positive effect of retention. Much the 
same conclusion has been reached by more recent 
research syntheses (Holmes, 1986; Holmes and 
Matthews, 1984) which have included studies 
published since Jackson's review. Even more re- 
cent studies, however (such as Shepard and 
Smith, 1989), not only conclude that grade repe- 
tition has not shown a benefit, but that it is 
harmful. 

Basis of Comparison 

The current meta-analyses of grade repetition 
have combined studies that focus on different 
liases of comparison. Students who are retained 
in grade, by virtue of their retention, wii! spend 
more time to attain the same grade in school as 
their same agemates. The question is whe'jser the 
progress of students should be compared after 
they have spent the same time in school (same 
time, but different grades) or after they are in the 
same grade (same grade, but different time). 
Although recent meta-analyses (Holmes, 1989; 
Shepard and Smith, 1989) present results sepa- 
rately for same age and same grade comparisons, 
they ultimately combine the results across com- 
parisons and treat the differences in effects as a 
methodological, not substantive, issue. 

Educational Practice 

The approximately 800 studies of grade repetition 
essentially examine the same questions again and 
again, albeit in different decades, different dis- 
tricts, and for different grades: Does grade repe- 
tition hurt or harm students? Are at-risk students 
better off with social promotion or retention? But 
embedded in these studies are very different edu- 
cational practices that are all being called 
"retention." Transition room placement, devel- 
opmental kindergartens, partial grade retention, 
complete grade repetition, and alternative pro- 
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grams are just some of the treatments reviewed 
collectively as grade repetition. 

At least four types of educational practice have 
been grouped together under the heading grade 

repetition: 

• recycling: repeating the grade, but receiving 
no additional resources or special program 

- alternative after failure: repeating the grade, 
but receiving additional help and special programs 

-aUertuaive pre failure: being placed in an ad- 
ditional-year program prior to actual failure 

- partial promotion: being failed or promoted 
in certain subjects only 

Most studies of the effect of grade repetition have 
mixed the types of programs under one generic 
name and not looked for distinct effects by type 

Best Evidence on the 

The intention of (his paper is not to review again 
every study done on grade repetition to arrive at a 
new estimate for the effect for grade retention. 
Instead, the intention is to reexamine those stud- 
ies which provide the best evidence of the effects 
of retention keeping in mind the limitations dis- 
cussed by Jackson (1975) and the additional dif- 
ficulties just discussed - the need to identify the 
basis of comparison and the educational practice 
being examined and the need to look for short 
term and long term effects. 

The consensus of several extensive reviews of 
grade retention (Jackson, 197S; Holmes, 
1986,1989; Holmes and Matthews, 1984) is that 
there is not a positive effect for grade retention on 
academic achievement or on student personal ad- 
justment. Holmes (1989) summarizes the posi- 
tion aptly in a recent meta-analysis: "The weight 
of empirical evidence argues against grade reten- 
tion.'' As Holmes and Matthews concluded in 
1984, "Those who continue to retain pupils at 
grade level do so despite cumulative research evi- 
dence showing that the potential for negative ef- 
fects consistently outweighs positive outcomes." 

Shepard and Smith take a more strident view, as- 
serting that "retentions do nothing to promote the 
achievement of the affected individuals or the av- 
erage of ,*he group as a whole and because the 
disadvantaged and minority children are most apt 
to be affected, retention should best be thought of 
as educational waste and a denial of life chances 



of retention employed. Often it is not possible to 
tell what happened in the year of retenuon. Given 
the lack of clarity of the "treatment," it is suspect 
to make conclusions about effectiveness or inef- 
fectiveness. If USL study ever found a positive 
effect, we might conclude that all things lumped 
under the heading "retention" were ineffective. 
But some studies do find positive effects, so we 
need to examine the variation in findings to see if 
there are practices in common across these posi- 
tive studies. 

Longitudinal Comparisons 

Studies also need to consider long-term effects of 
retention. Many educational practices, such as 
participation in certain preschool programs, have 
been shown to have an effect in the first year fol- 
lowing the treatment that eventually levels out or 
washes out completely. The long-term effects of 
retention need to be looked for as well as more 
immediate effects. 

Effects of Retention 

to those who most need the benefits of education. 
Retention has high cost and virtually no value, 
save the public relations advantages for the 
schools." (p.235). 

Despite the conclusions of these and other re- 
searchers, districts continue to retain children in 
grade. Although retention in grade does benefit 
districts by raising average grade level test scores 
(Walker and Levine, 1988 ), districts probably 
continue to retain because teachers and principals 
and parents believe it is in the best interest of the 
individual child. Shepard and Smith (1989) sug- 
gest that districts continue to retain because they 
are unable to carry out experiments to compare a 
particular child's progress when retained with 
what it would be had he been promoted. That is, 
school districts do not have a valid bests for com- 
parison. 

What is the Basis for Comparison? 

School districts are not the only parties potentially 
confused fibout the basis of comparison. Most 
research poorly identifies who and on what basis 
the comparison is being made. Are they compa- 
rable children? Are the comparisons made after 
equal time in school and unequal grade? Or, are 
the comparison made after unequal time in 
school and equal grade? Are comparisons made at 
one time point or multiple ones? Arc affective as 
well as achieveme nt measures compared? 
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The meta-analysis conducted by Holmes (1989) 
provides some useful data to address the compar- 
ison basis issue. He presents average effect sizes 
for studies which compared the retainees and 
non-retainers after equal time in school (same 
age, different grade) and after equal grade in 
school (same grade, different ages). The first 
comparison indicates what happened to the 
achievement after retainees have spent another 
year in school, compared to their same age peers 
who am now in a different grade. The second 
comparison indicates how the retained children 
performed when they reached the same grade, 
although they are now a year older. 

Holmes determines that the first comparison 
(same year) favors the promoted group. The 
average effect size is -.45, indicating that the 
retained group is nearly a half a standard 
deviation below their same age peon (who are 
also a grade ahead of them). This finding is 
consistent with Jackson's predictions about the 
manner in which methodology affects outcomes. 
Although the control groups were matched on a 
number of factors, the two groups no doubt 
continue to be different on a number of factors 
affecting the decisions to be promoted or retained. 
No matter how many factors students are matched 
on. there are always unmeasured factors at work 
which may favor the promoted group. The 
negative findings for retention in the same year 
comparison include some unknown amount that 
is due to unmeasured initial differences. 

When one looks at the same grade comparison, a 
different pattern is presented The retained stu- 
dents are higher in the first year (average effect 
size <= +.25), but the effects diminish over time 
until by the third year, they are no longer discer- 
nible. There is an initial boost followed by a fade 
away phenomenon. 

"ilw year and grade comparisons are both inter- 
preted by Holmes to indicate that retention is not 
an effective policy. The same year comparisons 
clearly are negative and the same age compar- 
isons, while positive, fade away in time. Holmes 
combines these effect sizes into a global estimate 
of the effect of retention. 

But, does it make sense to combine studies which 
measure same grade and same year comparisons? 
What educational significance should be afforded 
the same grade vs. same year comparisons? In 
part, the answer to that question depends upon 
the goal one sets for retention. Are students ex- 
pected to be remediated by the retained year and 
be up to the same level as their agemates? Or, are 



students expected to need additional time and at 
some point to be at the same level as their class- 
mates? This question of intention is of course 
central in the evaluation of retention as a policy. 
It is also possible that there are different answers 
in different districts and at different levels of 
schooling. We look now at selected studies which 
can be used to address some of these issues. 

Six studies among the twelve identified by 
Holmes (19S9) provide the base of research evi- 
dence to be considered here. These studies allow 
identification of the basis of comparison as well 
as the treatment and focus on longitudinal effects. 



INSERT TABLE 1 



Table 1 provides the effect sizes for the various 
grade/subject/comparison basis combinations. 

Peterson, DcGracie and Ayabe (1987) report on a 
longitudinal study of the effect of grade retention 
in the Mesa Public Schools. The achievement 
effects of retention were compared for students 
matched on sex, age and previews CAT score. 
Achievement for four years in the areas of read- 
ing, language and math were compared. The 
program combined retention with remedies assis- 
tance. 

They carried out comparisons with same year and 
same grade peers separately for students by 
grade. The first comparisons look at students in 
the same year, but who are in ? different grade. 
The basis of comparison is identified in the table 
as year. Because this was a longitudinal study, 
the students are being compared at distinct time 
points as well, here identified asTl. T2, T3, T4 
and TS. The study followed the students for 
three years, so effect sizes for differences be- 
tween the retained and promoted groups are pre- 
sented for T1.T2 and T3. This says that the ef- 
fect size favoring retention using same year com- 
parison for grade 1 students on the CAT reading 
was 1.06. The average effect size for the nine 
comparisons for grade 1 across the different sub- 
tests on the CAT was +.42. When the students 
were compared on these same subtests at the 
same grade in school (sec the fourth block of re- 
sults), the average effect size across the three sub- 
tests was +.70. 

The same grade grade comparisons result in 
larger effect sizes than the same year compar- 
isons. But both are positive results, indicating a 
positive effect for retention. There is the typical 
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diminution of effects in time. The program was a 
retention 4- individual plan program, which sug- 
gests that the positive effect in the year compar- 
isons could be the result of more appropriate in- 
struction. Whether looking at grade or age com- 
parisons, the effects of retention were substantial 
and positive. However, both low-performing 
promoted and retained students were still per- 
forming below district level. 

The importance of longitudinal studies in examin- 
ing the effects of grade retention is underscored 
by Baenen (1988), who followed 243 matched 
retained or promoted students in a five year study 
(see second entry Table 1). The basis of this 
comparison was when students were in the same 
grade. For the first three years, the results favor 
the retained group over the piomoted group. The 
effect size across reading and math for the same 
grade comparisons for the first three years is 35. 
It is important that they extended the study past 
this time, however, as these positive effects even- 
tually fade and in fact become negative. The av- 
erage effect size across all five years is .15. The 
students received additional remedial services in 
this program. 

Dobbs and Neville (1967) matched 30 students 
who were retained in the first grade on sex, race, 
SES and reading achievement with students who 
were promoted. They compared the achievement 
in reading and m*th on the Metropolitan 
Readiness test at the same year and once the stu- 
dents were in the same grade. The average effect 
size for the same year comparison was -.75, indi- 
cating mat the promoted group, which was in the 
next grade, had higher achievement scores. The 
same grade comparisons, carried out over two 
time points, show an average effect size of 1.36 
favoring the retained students. However, this ef- 
fect size is inflated by the small number of cases 
and by the short duration of the study. The edu- 
cational practice employed was reexposure to the 
same material. 

Cof field (1954) compared the achievement of 
current seventh graders who had failed at some 
point during grades 3-7 with the achievement of 
students matched on the iTBS in the year prior to 
the retention. He compared the achievement of 
the two groups in the year of the retention (same 
year comparison) and when the students were in 
the same grade (same grade comparison). In the 
report, analyses were conducted by the year in 
which failure occurred. There was not a signifi- 
cant difference across year failed so we averaged 
the effect sizes to come up with a -.77 overall ef- 
fect size for the same year comparison and a -.17 



effect size for the same grade comparison. This 
suggests that once the students were in the same 
grade, there was no difference (an ES -.17 being 
classified as not significant). 

Oldham (1982) retrospectively matched current 
1 1th graders who were retained in primary grades 
on IQ, gender, and entry age with a group that 
had been promoted and compared their achieve- 
ment in reading and math at Die 3rd, 6th and 10th 
grades, lliis is a same grade comparison. He 
found that the retained students were significant 
higher in mathematics achievement and while 
higher in reading scores, were not significantly 
different. The effect sizes were .24, .11 and .11 
for reading and .37, .37 and .40 for math at 
grades 3, 6 and 10 respectively. The average 
effect size was .28. 

Voll rath (1983) compared a sample of those re- 
tained in K-3 with a matched sample of students 
who were recommended for retention but who 
were promoted instead. He matched on IQ and a 
cognitive abilities test. He compared their 
achievement at grades 3 and 6 (same grade com- 
parison) and found that the retained group was 
significantly higher than the promoted group. On 
the composite score of the ITBS, the effect sizes 
were .75 and 1.00 at grades 3 and 6 respectively. 

Wright ( 1979) matched 50 students who had been 
retained in the first grade with nonretained stu- 
dents on sex, IQ, parent's education and chil- 
dren's achievement scores in the first grade. He 
carried out a same grade comparison wh en th e 
children reached the third grade using the CTBS 
reading, language and math subtests. The aver- 
age effect size for these same grade comparisons 
was .29. It is not possible to identify the treat- 
ment from the available description. 

Collectively, these studies can be classified by 
educational treatment (remedial vs recycle), by 
whether the same year or same grade was being 
compared, and by the time of the comparison 
(immediately after retention or long term). In 
Table 2, we indicate the effect sizes for the rele- 
vant cell and which study contributed to the cell. 
It is hazardous to over generalize from this table as 
the number of studies is small. But, this classifi- 
cation suggests the following trends. 
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If we had ignored the type of program and com- 
pared students in the same year at L v * end of the 
year of retention, the average effect size would 
nave been about zero, leading us to conclude that 
retention made no difference. However, this 
conclusion masks the fact that the remedial pro- 
gram showed an average effect size of 4.85 while 
the two studies using recycling had an average ef- 
fect size of -.86, so that the average appears to be 
0. 



Certainly the importance of this research lies not 
in its total effects, but in the finding that different 
programs have different effects. Other important 
parts to this story are that the choice of year or 
grade comparisons will also influence the effect 
size, as same gratfe comparisons, quite expect- 
edly, are larger. Finally, because any positive 
effects of retention are likely to dimmish over 
time, it is important to monitor effects over time. 



Discussion 



What can be concluded about grade retention 
from this reexamination? Is the evidence so firm 
that we can safely say that no child will ever un- 
der any circumstances benefit from retention or 
even that most children wiP not benefit from re- 
tention? 

1 . Studies which compare students when they 
are in school for equal time (unequal grade) favor 
promoted students. 

2. Studies which compare students when they 
are in the same grade (unequal time* favor re- 
tained students or show no difference. 

3. Studies which present longitudinal compar- 
isons show that any positive effect of retention 
fades out over a two to three year period. 

4. Neither social promotion nor retention per se 
are effective at solving the problem of providing 
appropriate instruction for low performing stu- 
dents. The n :~arch has been phrased in such a 
way that a yes or no answer is called for. In fact, 
the main conclusion should be that both policies 



are failures. In most cases, doing better than the 
comparison group still meant a low level of per- 
formance relative to the school population at 
large. Retaining may not help, but simply pro- 
moting isn't a solution either. 

5. Promotion or retention with additional in- 
struction is more effective than either policy 

alone. 

6. The salient issue is not what policy to adopt 
(retention vs. social promotion) but how to pro- 
vide appropriate instruction given student diver- 
sity. But this has always been the central ques- 
tion for school organization. Rather than con- 
tinuing to argue over inappropriate questions, 
future research should devote attention to 
locating, developing, and evaluating effective 
organizational responses to differences in student 
abilities and competencies. The best that can be 
said about grade repetition vs social promotion as 
polices is that neither is a victor and that the 
children are the losers if we continue to debate the 
issue as it has been formulated thus far. 
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Table 1 
Retention/Promotion Studies 
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Gratl* 
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•oh Rdg Grade 61 

Math 
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r ^ * retention 
matched pair 
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Table 2 



Effect Sizes for Retention/Promotion Studies Classified by Comparison Basis, Educational 

Treatment, and Immediacy of Measurement 
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a Peterson, deGracie, Ayabe (1987) 

b Baenen.N (1988) 

c Dobbs and Neville (1967) 
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f Voelrath (1983) 

g Wright (1979) 
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